Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 38659 |
| Missing cells | 457949 |
| Missing cells (%) | 37.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 9.4 MiB |
| Average record size in memory | 256.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Text | 6 |
| Categorical | 11 |
| Unsupported | 1 |
domains_count has constant value "" | Constant |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
CHROM is highly overall correlated with seq_region_name and 3 other fields | High correlation |
POS is highly overall correlated with AF and 2 other fields | High correlation |
1000gp3_eur_af is highly overall correlated with clinvar_id and 10 other fields | High correlation |
clinpred_rankscore is highly overall correlated with mutationassessor_rankscore and 4 other fields | High correlation |
clinvar_id is highly overall correlated with 1000gp3_eur_af and 5 other fields | High correlation |
gnomad_exomes_non_cancer_nfe_af is highly overall correlated with 1000gp3_eur_af and 10 other fields | High correlation |
mutationassessor_rankscore is highly overall correlated with clinpred_rankscore and 5 other fields | High correlation |
mutationtaster_converted_rankscore is highly overall correlated with clinpred_pred and 1 other fields | High correlation |
polyphen2_hdiv_rankscore is highly overall correlated with clinpred_rankscore and 5 other fields | High correlation |
sift_converted_rankscore is highly overall correlated with 1000gp3_eur_af and 8 other fields | High correlation |
pubmed_count is highly overall correlated with 1000gp3_eur_af and 4 other fields | High correlation |
frequencies_af is highly overall correlated with 1000gp3_eur_af and 8 other fields | High correlation |
frequencies_gnomadg_nfe is highly overall correlated with 1000gp3_eur_af and 5 other fields | High correlation |
seq_region_name is highly overall correlated with CHROM and 3 other fields | High correlation |
AF is highly overall correlated with POS and 1 other fields | High correlation |
GENEINFO is highly overall correlated with CHROM and 7 other fields | High correlation |
TISSUE is highly overall correlated with clinpred_rankscore and 2 other fields | High correlation |
CTYPE is highly overall correlated with CHROM and 7 other fields | High correlation |
GT is highly overall correlated with 1000gp3_eur_af and 2 other fields | High correlation |
clinpred_pred is highly overall correlated with clinpred_rankscore and 7 other fields | High correlation |
strand is highly overall correlated with CHROM and 8 other fields | High correlation |
clin_sig_allele is highly overall correlated with clinpred_pred | High correlation |
variant_class is highly overall correlated with 1000gp3_eur_af and 8 other fields | High correlation |
AF is highly imbalanced (98.0%) | Imbalance |
TISSUE is highly imbalanced (59.5%) | Imbalance |
clinpred_pred is highly imbalanced (71.3%) | Imbalance |
clin_sig_allele is highly imbalanced (90.8%) | Imbalance |
variant_class is highly imbalanced (65.6%) | Imbalance |
RIS. is highly imbalanced (97.1%) | Imbalance |
AF has 17793 (46.0%) missing values | Missing |
GENEINFO has 17793 (46.0%) missing values | Missing |
1000gp3_eur_af has 30399 (78.6%) missing values | Missing |
clinpred_pred has 29208 (75.6%) missing values | Missing |
clinpred_rankscore has 29208 (75.6%) missing values | Missing |
clinvar_id has 29549 (76.4%) missing values | Missing |
domains_count has 18486 (47.8%) missing values | Missing |
gnomad_exomes_non_cancer_nfe_af has 30037 (77.7%) missing values | Missing |
mutationassessor_rankscore has 33020 (85.4%) missing values | Missing |
mutationtaster_converted_rankscore has 28988 (75.0%) missing values | Missing |
polyphen2_hdiv_rankscore has 32981 (85.3%) missing values | Missing |
sift_converted_rankscore has 29211 (75.6%) missing values | Missing |
strand has 1973 (5.1%) missing values | Missing |
sift_score has 28997 (75.0%) missing values | Missing |
hgvsc has 2144 (5.5%) missing values | Missing |
clin_sig_allele has 8565 (22.2%) missing values | Missing |
pubmed_count has 28072 (72.6%) missing values | Missing |
frequencies has 38659 (100.0%) missing values | Missing |
frequencies_af has 9481 (24.5%) missing values | Missing |
frequencies_gnomadg_nfe has 9481 (24.5%) missing values | Missing |
variant_class has 1952 (5.0%) missing values | Missing |
seq_region_name has 1952 (5.0%) missing values | Missing |
frequencies is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
clinpred_rankscore has 955 (2.5%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-24 10:28:18.923735 |
|---|---|
| Analysis finished | 2023-11-24 10:28:46.173317 |
| Duration | 27.25 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
CHROM
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.114928 |
| Minimum | 1 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 11 |
| median | 13 |
| Q3 | 17 |
| 95-th percentile | 17 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.1385038 |
|---|---|
| Coefficient of variation (CV) | 0.42414646 |
| Kurtosis | -0.44466683 |
| Mean | 12.114928 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.7315642 |
| Sum | 468351 |
| Variance | 26.404221 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 13602 | |
| 17 | 11058 | |
| 2 | 3130 | 8.1% |
| 11 | 2717 | 7.0% |
| 3 | 1840 | 4.8% |
| 7 | 1571 | 4.1% |
| 5 | 1471 | 3.8% |
| 16 | 850 | 2.2% |
| 22 | 635 | 1.6% |
| 8 | 479 | 1.2% |
| Other values (4) | 1306 | 3.4% |
| Value | Count | Frequency (%) |
| 1 | 222 | 0.6% |
| 2 | 3130 | 8.1% |
| 3 | 1840 | 4.8% |
| 4 | 376 | 1.0% |
| 5 | 1471 | 3.8% |
| 7 | 1571 | 4.1% |
| 8 | 479 | 1.2% |
| 10 | 340 | 0.9% |
| 11 | 2717 | 7.0% |
| 13 | 13602 |
| Value | Count | Frequency (%) |
| 22 | 635 | 1.6% |
| 19 | 368 | 1.0% |
| 17 | 11058 | |
| 16 | 850 | 2.2% |
| 13 | 13602 | |
| 11 | 2717 | 7.0% |
| 10 | 340 | 0.9% |
| 8 | 479 | 1.2% |
| 7 | 1571 | 4.1% |
| 5 | 1471 | 3.8% |
POS
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3431 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55601948 |
| Minimum | 1206466 |
|---|---|
| Maximum | 2.1567462 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 1206466 |
|---|---|
| 5-th percentile | 7578645 |
| Q1 | 32913055 |
| median | 41223094 |
| Q3 | 48023115 |
| 95-th percentile | 1.7892243 × 108 |
| Maximum | 2.1567462 × 108 |
| Range | 2.1446815 × 108 |
| Interquartile range (IQR) | 15110060 |
Descriptive statistics
| Standard deviation | 45465822 |
|---|---|
| Coefficient of variation (CV) | 0.81770195 |
| Kurtosis | 4.238488 |
| Mean | 55601948 |
| Median Absolute Deviation (MAD) | 8308089 |
| Skewness | 2.134477 |
| Sum | 2.1495157 × 1012 |
| Variance | 2.0671409 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32913055 | 1722 | 4.5% |
| 32915005 | 1722 | 4.5% |
| 32929387 | 1720 | 4.4% |
| 32936646 | 1239 | 3.2% |
| 41244936 | 991 | 2.6% |
| 41223094 | 962 | 2.5% |
| 41234470 | 958 | 2.5% |
| 41244000 | 955 | 2.5% |
| 41245466 | 955 | 2.5% |
| 41244435 | 953 | 2.5% |
| Other values (3421) | 26482 |
| Value | Count | Frequency (%) |
| 1206466 | 1 | < 0.1% |
| 1206566 | 1 | < 0.1% |
| 1207176 | 1 | < 0.1% |
| 1207238 | 19 | |
| 1207280 | 7 | < 0.1% |
| 1218219 | 9 | |
| 1218523 | 9 | |
| 1218587 | 4 | < 0.1% |
| 1218596 | 7 | < 0.1% |
| 1219129 | 15 |
| Value | Count | Frequency (%) |
| 215674619 | 5 | < 0.1% |
| 215674445 | 1 | < 0.1% |
| 215674436 | 47 | |
| 215674376 | 3 | < 0.1% |
| 215674371 | 41 | |
| 215674341 | 43 | |
| 215674323 | 43 | |
| 215674224 | 30 | |
| 215674090 | 43 | |
| 215673948 | 6 | < 0.1% |
REF
Text
| Distinct | 295 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
Length
| Max length | 75 |
|---|---|
| Median length | 1 |
| Mean length | 1.5186632 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58710 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 97 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | G |
|---|---|
| 2nd row | T |
| 3rd row | A |
| 4th row | G |
| 5th row | T |
| Value | Count | Frequency (%) |
| t | 10985 | |
| a | 9939 | |
| g | 8878 | |
| c | 4894 | |
| tt | 407 | 1.1% |
| aa | 187 | 0.5% |
| taa | 80 | 0.2% |
| ttg | 79 | 0.2% |
| at | 67 | 0.2% |
| ctttttttttttttttttt | 61 | 0.2% |
| Other values (285) | 3082 | 8.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 21678 | |
| A | 18962 | |
| G | 10819 | |
| C | 7251 | 12.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 58710 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 21678 | |
| A | 18962 | |
| G | 10819 | |
| C | 7251 | 12.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 58710 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 21678 | |
| A | 18962 | |
| G | 10819 | |
| C | 7251 | 12.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58710 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 21678 | |
| A | 18962 | |
| G | 10819 | |
| C | 7251 | 12.4% |
ALT
Text
| Distinct | 199 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
Length
| Max length | 342 |
|---|---|
| Median length | 1 |
| Mean length | 1.3797046 |
| Min length | 1 |
Characters and Unicode
| Total characters | 53338 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 56 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | A |
|---|---|
| 2nd row | C |
| 3rd row | G |
| 4th row | C |
| 5th row | C |
| Value | Count | Frequency (%) |
| c | 13739 | |
| g | 9290 | |
| a | 7539 | |
| t | 5828 | |
| tt | 114 | 0.3% |
| cacac | 75 | 0.2% |
| ag | 68 | 0.2% |
| tat | 60 | 0.2% |
| tta | 57 | 0.1% |
| atttttttttt | 51 | 0.1% |
| Other values (189) | 1838 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 15861 | |
| A | 13146 | |
| T | 12926 | |
| G | 11300 | |
| N | 100 | 0.2% |
| < | 1 | < 0.1% |
| D | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
| > | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 53336 | |
| Math Symbol | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 15861 | |
| A | 13146 | |
| T | 12926 | |
| G | 11300 | |
| N | 100 | 0.2% |
| D | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 1 | |
| > | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53336 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 15861 | |
| A | 13146 | |
| T | 12926 | |
| G | 11300 | |
| N | 100 | 0.2% |
| D | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| < | 1 | |
| > | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53338 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 15861 | |
| A | 13146 | |
| T | 12926 | |
| G | 11300 | |
| N | 100 | 0.2% |
| < | 1 | < 0.1% |
| D | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
| > | 1 | < 0.1% |
AF
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17793 |
| Missing (%) | 46.0% |
| Memory size | 302.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 39 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 62598 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 20827 | |
| 1.0 | 39 | 0.1% |
| (Missing) | 17793 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 20827 | |
| 1.0 | 39 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 41693 | |
| . | 20866 | |
| 1 | 39 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 41732 | |
| Other Punctuation | 20866 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 41693 | |
| 1 | 39 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 62598 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 41693 | |
| . | 20866 | |
| 1 | 39 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 41693 | |
| . | 20866 | |
| 1 | 39 | 0.1% |
GENEINFO
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17793 |
| Missing (%) | 46.0% |
| Memory size | 302.1 KiB |
| BRCA2 | |
|---|---|
| BRCA1 | |
| BRCA2:675 | 308 |
| BRCA1:672 | 191 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.095658 |
| Min length | 5 |
Characters and Unicode
| Total characters | 106326 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BRCA2 |
|---|---|
| 2nd row | BRCA2 |
| 3rd row | BRCA2 |
| 4th row | BRCA2 |
| 5th row | BRCA2 |
Common Values
| Value | Count | Frequency (%) |
| BRCA2 | 12293 | |
| BRCA1 | 8074 | |
| BRCA2:675 | 308 | 0.8% |
| BRCA1:672 | 191 | 0.5% |
| (Missing) | 17793 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brca2 | 12293 | |
| brca1 | 8074 | |
| brca2:675 | 308 | 1.5% |
| brca1:672 | 191 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 20866 | |
| R | 20866 | |
| C | 20866 | |
| A | 20866 | |
| 2 | 12792 | |
| 1 | 8265 | 7.8% |
| : | 499 | 0.5% |
| 6 | 499 | 0.5% |
| 7 | 499 | 0.5% |
| 5 | 308 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 83464 | |
| Decimal Number | 22363 | 21.0% |
| Other Punctuation | 499 | 0.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 12792 | |
| 1 | 8265 | |
| 6 | 499 | 2.2% |
| 7 | 499 | 2.2% |
| 5 | 308 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 20866 | |
| R | 20866 | |
| C | 20866 | |
| A | 20866 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 499 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83464 | |
| Common | 22862 | 21.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 12792 | |
| 1 | 8265 | |
| : | 499 | 2.2% |
| 6 | 499 | 2.2% |
| 7 | 499 | 2.2% |
| 5 | 308 | 1.3% |
Latin
| Value | Count | Frequency (%) |
| B | 20866 | |
| R | 20866 | |
| C | 20866 | |
| A | 20866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106326 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 20866 | |
| R | 20866 | |
| C | 20866 | |
| A | 20866 | |
| 2 | 12792 | |
| 1 | 8265 | 7.8% |
| : | 499 | 0.5% |
| 6 | 499 | 0.5% |
| 7 | 499 | 0.5% |
| 5 | 308 | 0.3% |
NAME
Text
| Distinct | 1722 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 8.907085 |
| Min length | 6 |
Characters and Unicode
| Total characters | 344339 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BRCA24/19 |
|---|---|
| 2nd row | BRCA24/19 |
| 3rd row | BRCA24/19 |
| 4th row | BRCA24/19 |
| 5th row | BRCA24/19 |
| Value | Count | Frequency (%) |
| hc68/19 | 506 | 1.3% |
| hc1/19 | 500 | 1.3% |
| brca290/21 | 486 | 1.3% |
| brca174/21 | 464 | 1.2% |
| hc10/19 | 442 | 1.1% |
| hc65/19 | 441 | 1.1% |
| hc1/22 | 441 | 1.1% |
| brca37/21 | 438 | 1.1% |
| hc100/19 | 435 | 1.1% |
| hc101/19 | 430 | 1.1% |
| Other values (1712) | 34076 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 41635 | |
| 2 | 40431 | |
| C | 38659 | |
| / | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| 9 | 22739 | |
| 0 | 18233 | 5.3% |
| H | 12512 | 3.6% |
| Other values (6) | 53030 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 176068 | |
| Uppercase Letter | 129612 | |
| Other Punctuation | 38659 | 11.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 41635 | |
| 2 | 40431 | |
| 9 | 22739 | |
| 0 | 18233 | |
| 3 | 11030 | 6.3% |
| 4 | 10277 | 5.8% |
| 6 | 9112 | 5.2% |
| 5 | 8539 | 4.8% |
| 7 | 7899 | 4.5% |
| 8 | 6173 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| H | 12512 | 9.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 38659 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 214727 | |
| Latin | 129612 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 41635 | |
| 2 | 40431 | |
| / | 38659 | |
| 9 | 22739 | |
| 0 | 18233 | |
| 3 | 11030 | 5.1% |
| 4 | 10277 | 4.8% |
| 6 | 9112 | 4.2% |
| 5 | 8539 | 4.0% |
| 7 | 7899 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| C | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| H | 12512 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 344339 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 41635 | |
| 2 | 40431 | |
| C | 38659 | |
| / | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| 9 | 22739 | |
| 0 | 18233 | 5.3% |
| H | 12512 | 3.6% |
| Other values (6) | 53030 |
TISSUE
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
| GERMLINE | |
|---|---|
| SOMATIC | 3124 |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.9191909 |
| Min length | 7 |
Characters and Unicode
| Total characters | 306148 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GERMLINE |
|---|---|
| 2nd row | GERMLINE |
| 3rd row | GERMLINE |
| 4th row | GERMLINE |
| 5th row | GERMLINE |
Common Values
| Value | Count | Frequency (%) |
| GERMLINE | 35535 | |
| SOMATIC | 3124 | 8.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| germline | 35535 | |
| somatic | 3124 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 71070 | |
| M | 38659 | |
| I | 38659 | |
| G | 35535 | |
| R | 35535 | |
| L | 35535 | |
| N | 35535 | |
| S | 3124 | 1.0% |
| O | 3124 | 1.0% |
| A | 3124 | 1.0% |
| Other values (2) | 6248 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 306148 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 71070 | |
| M | 38659 | |
| I | 38659 | |
| G | 35535 | |
| R | 35535 | |
| L | 35535 | |
| N | 35535 | |
| S | 3124 | 1.0% |
| O | 3124 | 1.0% |
| A | 3124 | 1.0% |
| Other values (2) | 6248 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 306148 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 71070 | |
| M | 38659 | |
| I | 38659 | |
| G | 35535 | |
| R | 35535 | |
| L | 35535 | |
| N | 35535 | |
| S | 3124 | 1.0% |
| O | 3124 | 1.0% |
| A | 3124 | 1.0% |
| Other values (2) | 6248 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 306148 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 71070 | |
| M | 38659 | |
| I | 38659 | |
| G | 35535 | |
| R | 35535 | |
| L | 35535 | |
| N | 35535 | |
| S | 3124 | 1.0% |
| O | 3124 | 1.0% |
| A | 3124 | 1.0% |
| Other values (2) | 6248 | 2.0% |
CTYPE
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
| BRCA | |
|---|---|
| HC |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.3526992 |
| Min length | 2 |
Characters and Unicode
| Total characters | 129612 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BRCA |
|---|---|
| 2nd row | BRCA |
| 3rd row | BRCA |
| 4th row | BRCA |
| 5th row | BRCA |
Common Values
| Value | Count | Frequency (%) |
| BRCA | 26147 | |
| HC | 12512 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brca | 26147 | |
| hc | 12512 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| H | 12512 | 9.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 129612 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| H | 12512 | 9.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 129612 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| H | 12512 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129612 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 38659 | |
| B | 26147 | |
| R | 26147 | |
| A | 26147 | |
| H | 12512 | 9.7% |
GT
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
| 0/1 | |
|---|---|
| 1/1 | |
| 0/0 | 2378 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 115977 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0/1 |
|---|---|
| 2nd row | 0/1 |
| 3rd row | 1/1 |
| 4th row | 1/1 |
| 5th row | 1/1 |
Common Values
| Value | Count | Frequency (%) |
| 0/1 | 24055 | |
| 1/1 | 12226 | |
| 0/0 | 2378 | 6.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0/1 | 24055 | |
| 1/1 | 12226 | |
| 0/0 | 2378 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 48507 | |
| / | 38659 | |
| 0 | 28811 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 77318 | |
| Other Punctuation | 38659 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 48507 | |
| 0 | 28811 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 38659 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 115977 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 48507 | |
| / | 38659 | |
| 0 | 28811 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 115977 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 48507 | |
| / | 38659 | |
| 0 | 28811 |
1000gp3_eur_af
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 30399 |
| Missing (%) | 78.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.44026266 |
| Minimum | 0 |
|---|---|
| Maximum | 0.99900596 |
| Zeros | 92 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.024850895 |
| Q1 | 0.29522863 |
| median | 0.35685885 |
| Q3 | 0.36282306 |
| 95-th percentile | 0.99900596 |
| Maximum | 0.99900596 |
| Range | 0.99900596 |
| Interquartile range (IQR) | 0.067594433 |
Descriptive statistics
| Standard deviation | 0.31685814 |
|---|---|
| Coefficient of variation (CV) | 0.71970251 |
| Kurtosis | -0.5032036 |
| Mean | 0.44026266 |
| Median Absolute Deviation (MAD) | 0.061630219 |
| Skewness | 0.79913224 |
| Sum | 3636.5696 |
| Variance | 0.10039908 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.9990059642 | 1720 | 4.4% |
| 0.3628230616 | 991 | 2.6% |
| 0.3598409543 | 962 | 2.5% |
| 0.3548707753 | 955 | 2.5% |
| 0.3568588469 | 953 | 2.5% |
| 0.2952286282 | 827 | 2.1% |
| 0.08349900596 | 270 | 0.7% |
| 0.05964214712 | 257 | 0.7% |
| 0.03479125249 | 255 | 0.7% |
| 0.02882703777 | 113 | 0.3% |
| Other values (45) | 957 | 2.5% |
| (Missing) | 30399 |
| Value | Count | Frequency (%) |
| 0 | 92 | |
| 0.0009940357853 | 64 | |
| 0.001988071571 | 15 | < 0.1% |
| 0.002982107356 | 60 | |
| 0.003976143141 | 3 | < 0.1% |
| 0.004970178926 | 11 | < 0.1% |
| 0.005964214712 | 3 | < 0.1% |
| 0.006958250497 | 6 | < 0.1% |
| 0.007952286282 | 28 | 0.1% |
| 0.008946322068 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.9990059642 | 1720 | |
| 0.8767395626 | 45 | 0.1% |
| 0.7654075547 | 45 | 0.1% |
| 0.7147117296 | 43 | 0.1% |
| 0.6411530815 | 41 | 0.1% |
| 0.5616302187 | 40 | 0.1% |
| 0.4691848907 | 33 | 0.1% |
| 0.4373757455 | 31 | 0.1% |
| 0.3946322068 | 30 | 0.1% |
| 0.3787276342 | 24 | 0.1% |
clinpred_pred
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 29208 |
| Missing (%) | 75.6% |
| Memory size | 302.1 KiB |
| T | |
|---|---|
| D | 475 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 9451 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | T |
|---|---|
| 2nd row | T |
| 3rd row | T |
| 4th row | T |
| 5th row | T |
Common Values
| Value | Count | Frequency (%) |
| T | 8976 | 23.2% |
| D | 475 | 1.2% |
| (Missing) | 29208 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| t | 8976 | |
| d | 475 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 8976 | |
| D | 475 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9451 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 8976 | |
| D | 475 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9451 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 8976 | |
| D | 475 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9451 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 8976 | |
| D | 475 | 5.0% |
clinpred_rankscore
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1007 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 29208 |
| Missing (%) | 75.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.043587135 |
| Minimum | 0 |
|---|---|
| Maximum | 0.95599 |
| Zeros | 955 |
| Zeros (%) | 2.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.00024 |
| median | 0.00078 |
| Q3 | 0.02075 |
| 95-th percentile | 0.33286 |
| Maximum | 0.95599 |
| Range | 0.95599 |
| Interquartile range (IQR) | 0.02051 |
Descriptive statistics
| Standard deviation | 0.13955438 |
|---|---|
| Coefficient of variation (CV) | 3.2017333 |
| Kurtosis | 16.493927 |
| Mean | 0.043587135 |
| Median Absolute Deviation (MAD) | 0.00064 |
| Skewness | 4.0443231 |
| Sum | 411.94201 |
| Variance | 0.019475425 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.00085 | 1721 | 4.5% |
| 0.00026 | 991 | 2.6% |
| 0.00024 | 962 | 2.5% |
| 0 | 955 | 2.5% |
| 0.02075 | 953 | 2.5% |
| 0.00012 | 827 | 2.1% |
| 0.00059 | 383 | 1.0% |
| 0.02536 | 257 | 0.7% |
| 9 × 10-5 | 150 | 0.4% |
| 0.00142 | 127 | 0.3% |
| Other values (997) | 2125 | 5.5% |
| (Missing) | 29208 |
| Value | Count | Frequency (%) |
| 0 | 955 | |
| 1 × 10-5 | 31 | 0.1% |
| 2 × 10-5 | 6 | < 0.1% |
| 4 × 10-5 | 5 | < 0.1% |
| 5 × 10-5 | 26 | 0.1% |
| 9 × 10-5 | 150 | 0.4% |
| 0.0001 | 8 | < 0.1% |
| 0.00012 | 827 | |
| 0.00014 | 13 | < 0.1% |
| 0.00016 | 45 | 0.1% |
| Value | Count | Frequency (%) |
| 0.95599 | 7 | |
| 0.95503 | 1 | < 0.1% |
| 0.94592 | 1 | < 0.1% |
| 0.92979 | 1 | < 0.1% |
| 0.92389 | 1 | < 0.1% |
| 0.91716 | 1 | < 0.1% |
| 0.90962 | 1 | < 0.1% |
| 0.90576 | 4 | |
| 0.90554 | 1 | < 0.1% |
| 0.90116 | 1 | < 0.1% |
clinvar_id
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 725 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 29549 |
| Missing (%) | 76.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88995.646 |
| Minimum | 829 |
|---|---|
| Maximum | 1332623 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 829 |
|---|---|
| 5-th percentile | 9329 |
| Q1 | 41808 |
| median | 41818 |
| Q3 | 133738 |
| 95-th percentile | 183700 |
| Maximum | 1332623 |
| Range | 1331794 |
| Interquartile range (IQR) | 91930 |
Descriptive statistics
| Standard deviation | 144642.29 |
|---|---|
| Coefficient of variation (CV) | 1.6252738 |
| Kurtosis | 26.778679 |
| Mean | 88995.646 |
| Median Absolute Deviation (MAD) | 251 |
| Skewness | 4.8594024 |
| Sum | 8.1075033 × 108 |
| Variance | 2.0921393 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 133738 | 1720 | 4.4% |
| 41812 | 991 | 2.6% |
| 41827 | 962 | 2.5% |
| 41818 | 955 | 2.5% |
| 41815 | 953 | 2.5% |
| 9329 | 827 | 2.1% |
| 41808 | 270 | 0.7% |
| 41803 | 257 | 0.7% |
| 41545 | 128 | 0.3% |
| 41567 | 127 | 0.3% |
| Other values (715) | 1920 | 5.0% |
| (Missing) | 29549 |
| Value | Count | Frequency (%) |
| 829 | 2 | < 0.1% |
| 1762 | 3 | < 0.1% |
| 3048 | 1 | < 0.1% |
| 5294 | 1 | < 0.1% |
| 8045 | 3 | < 0.1% |
| 9329 | 827 | |
| 9347 | 1 | < 0.1% |
| 12351 | 43 | 0.1% |
| 17661 | 2 | < 0.1% |
| 17670 | 113 | 0.3% |
| Value | Count | Frequency (%) |
| 1332623 | 1 | < 0.1% |
| 1319574 | 2 | < 0.1% |
| 1319570 | 2 | < 0.1% |
| 1312623 | 1 | < 0.1% |
| 1309096 | 1 | < 0.1% |
| 1166234 | 24 | |
| 1131690 | 1 | < 0.1% |
| 1064262 | 1 | < 0.1% |
| 1059432 | 1 | < 0.1% |
| 1056420 | 1 | < 0.1% |
domains_count
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18486 |
| Missing (%) | 47.8% |
| Memory size | 302.1 KiB |
| 2.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 60519 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 20173 | |
| (Missing) | 18486 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 20173 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 20173 | |
| . | 20173 | |
| 0 | 20173 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40346 | |
| Other Punctuation | 20173 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 20173 | |
| 0 | 20173 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20173 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 60519 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 20173 | |
| . | 20173 | |
| 0 | 20173 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 60519 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 20173 | |
| . | 20173 | |
| 0 | 20173 |
gnomad_exomes_non_cancer_nfe_af
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 272 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 30037 |
| Missing (%) | 77.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.40685234 |
| Minimum | 0 |
|---|---|
| Maximum | 0.999707 |
| Zeros | 113 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.000489793 |
| Q1 | 0.278914 |
| median | 0.325862 |
| Q3 | 0.334428 |
| 95-th percentile | 0.999707 |
| Maximum | 0.999707 |
| Range | 0.999707 |
| Interquartile range (IQR) | 0.055514 |
Descriptive statistics
| Standard deviation | 0.32595217 |
|---|---|
| Coefficient of variation (CV) | 0.80115594 |
| Kurtosis | -0.40957203 |
| Mean | 0.40685234 |
| Median Absolute Deviation (MAD) | 0.046948 |
| Skewness | 0.89932934 |
| Sum | 3507.8809 |
| Variance | 0.10624482 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.999707 | 1720 | 4.4% |
| 0.334428 | 991 | 2.6% |
| 0.326518 | 962 | 2.5% |
| 0.324908 | 955 | 2.5% |
| 0.325862 | 953 | 2.5% |
| 0.278914 | 827 | 2.1% |
| 0.0777997 | 270 | 0.7% |
| 0.0645935 | 257 | 0.7% |
| 0.0348801 | 128 | 0.3% |
| 0.034771 | 127 | 0.3% |
| Other values (262) | 1432 | 3.7% |
| (Missing) | 30037 |
| Value | Count | Frequency (%) |
| 0 | 113 | |
| 9.73312 × 10-6 | 1 | < 0.1% |
| 9.73539 × 10-6 | 2 | < 0.1% |
| 9.73634 × 10-6 | 1 | < 0.1% |
| 9.73672 × 10-6 | 1 | < 0.1% |
| 9.73691 × 10-6 | 2 | < 0.1% |
| 9.73786 × 10-6 | 1 | < 0.1% |
| 9.73881 × 10-6 | 1 | < 0.1% |
| 9.73975 × 10-6 | 3 | < 0.1% |
| 9.74051 × 10-6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.999707 | 1720 | |
| 0.850549 | 45 | 0.1% |
| 0.770919 | 45 | 0.1% |
| 0.738026 | 43 | 0.1% |
| 0.617268 | 41 | 0.1% |
| 0.587903 | 40 | 0.1% |
| 0.435408 | 33 | 0.1% |
| 0.416586 | 31 | 0.1% |
| 0.403692 | 24 | 0.1% |
| 0.377731 | 30 | 0.1% |
mutationassessor_rankscore
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 266 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 33020 |
| Missing (%) | 85.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.32885815 |
| Minimum | 4 × 10-5 |
|---|---|
| Maximum | 0.98483 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 4 × 10-5 |
|---|---|
| 5-th percentile | 4 × 10-5 |
| Q1 | 0.02676 |
| median | 0.33814 |
| Q3 | 0.64647 |
| 95-th percentile | 0.90961 |
| Maximum | 0.98483 |
| Range | 0.98479 |
| Interquartile range (IQR) | 0.61971 |
Descriptive statistics
| Standard deviation | 0.30098866 |
|---|---|
| Coefficient of variation (CV) | 0.91525378 |
| Kurtosis | -1.3213004 |
| Mean | 0.32885815 |
| Median Absolute Deviation (MAD) | 0.30833 |
| Skewness | 0.2783291 |
| Sum | 1854.4311 |
| Variance | 0.090594173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 × 10-5 | 991 | 2.6% |
| 0.48678 | 963 | 2.5% |
| 0.02676 | 955 | 2.5% |
| 0.64647 | 953 | 2.5% |
| 0.11182 | 279 | 0.7% |
| 0.92174 | 257 | 0.7% |
| 0.55503 | 113 | 0.3% |
| 0.25572 | 63 | 0.2% |
| 0.09039 | 56 | 0.1% |
| 0.01383 | 45 | 0.1% |
| Other values (256) | 964 | 2.5% |
| (Missing) | 33020 |
| Value | Count | Frequency (%) |
| 4 × 10-5 | 991 | |
| 0.00015 | 2 | < 0.1% |
| 0.00021 | 1 | < 0.1% |
| 0.00063 | 2 | < 0.1% |
| 0.00086 | 1 | < 0.1% |
| 0.00254 | 45 | 0.1% |
| 0.00541 | 1 | < 0.1% |
| 0.00573 | 13 | < 0.1% |
| 0.00597 | 36 | 0.1% |
| 0.00812 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| 0.98483 | 2 | |
| 0.98424 | 3 | |
| 0.97262 | 1 | < 0.1% |
| 0.96783 | 1 | < 0.1% |
| 0.95518 | 1 | < 0.1% |
| 0.95291 | 1 | < 0.1% |
| 0.94976 | 1 | < 0.1% |
| 0.94936 | 1 | < 0.1% |
| 0.94485 | 3 | |
| 0.94442 | 1 | < 0.1% |
mutationtaster_converted_rankscore
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 270 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 28988 |
| Missing (%) | 75.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.20540653 |
| Minimum | 0.08975 |
|---|---|
| Maximum | 0.81001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0.08975 |
|---|---|
| 5-th percentile | 0.08975 |
| Q1 | 0.08975 |
| median | 0.08975 |
| Q3 | 0.22811 |
| 95-th percentile | 0.81001 |
| Maximum | 0.81001 |
| Range | 0.72026 |
| Interquartile range (IQR) | 0.13836 |
Descriptive statistics
| Standard deviation | 0.2179582 |
|---|---|
| Coefficient of variation (CV) | 1.0611065 |
| Kurtosis | 1.3831495 |
| Mean | 0.20540653 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.6917373 |
| Sum | 1986.4866 |
| Variance | 0.047505775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.08975 | 7042 | 18.2% |
| 0.58761 | 1086 | 2.8% |
| 0.81001 | 526 | 1.4% |
| 0.25265 | 257 | 0.7% |
| 0.22811 | 113 | 0.3% |
| 0.25075 | 45 | 0.1% |
| 0.27532 | 45 | 0.1% |
| 0.18612 | 31 | 0.1% |
| 0.20638 | 25 | 0.1% |
| 0.23243 | 20 | 0.1% |
| Other values (260) | 481 | 1.2% |
| (Missing) | 28988 |
| Value | Count | Frequency (%) |
| 0.08975 | 7042 | |
| 0.18198 | 10 | < 0.1% |
| 0.18612 | 31 | 0.1% |
| 0.18878 | 7 | < 0.1% |
| 0.19072 | 11 | < 0.1% |
| 0.19238 | 1 | < 0.1% |
| 0.19486 | 1 | < 0.1% |
| 0.19599 | 1 | < 0.1% |
| 0.19853 | 1 | < 0.1% |
| 0.19925 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.81001 | 526 | |
| 0.58761 | 1086 | |
| 0.54805 | 2 | < 0.1% |
| 0.53665 | 3 | < 0.1% |
| 0.52935 | 14 | < 0.1% |
| 0.52396 | 6 | < 0.1% |
| 0.51968 | 14 | < 0.1% |
| 0.51612 | 3 | < 0.1% |
| 0.51308 | 2 | < 0.1% |
| 0.51042 | 1 | < 0.1% |
polyphen2_hdiv_rankscore
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 213 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 32981 |
| Missing (%) | 85.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.26620307 |
| Minimum | 0.02946 |
|---|---|
| Maximum | 0.90584 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0.02946 |
|---|---|
| 5-th percentile | 0.02946 |
| Q1 | 0.02946 |
| median | 0.28547 |
| Q3 | 0.52359 |
| 95-th percentile | 0.7322 |
| Maximum | 0.90584 |
| Range | 0.87638 |
| Interquartile range (IQR) | 0.49413 |
Descriptive statistics
| Standard deviation | 0.24119701 |
|---|---|
| Coefficient of variation (CV) | 0.90606397 |
| Kurtosis | -0.82086029 |
| Mean | 0.26620307 |
| Median Absolute Deviation (MAD) | 0.23812 |
| Skewness | 0.57671055 |
| Sum | 1511.5011 |
| Variance | 0.058176 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.02946 | 2146 | 5.6% |
| 0.31319 | 973 | 2.5% |
| 0.52359 | 953 | 2.5% |
| 0.07471 | 315 | 0.8% |
| 0.7322 | 271 | 0.7% |
| 0.57829 | 119 | 0.3% |
| 0.90584 | 74 | 0.2% |
| 0.28547 | 56 | 0.1% |
| 0.2013 | 43 | 0.1% |
| 0.43117 | 43 | 0.1% |
| Other values (203) | 685 | 1.8% |
| (Missing) | 32981 |
| Value | Count | Frequency (%) |
| 0.02946 | 2146 | |
| 0.07471 | 315 | 0.8% |
| 0.09854 | 37 | 0.1% |
| 0.11197 | 4 | < 0.1% |
| 0.12183 | 2 | < 0.1% |
| 0.12996 | 3 | < 0.1% |
| 0.13644 | 33 | 0.1% |
| 0.14184 | 2 | < 0.1% |
| 0.14655 | 15 | < 0.1% |
| 0.15093 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.90584 | 74 | 0.2% |
| 0.77913 | 30 | 0.1% |
| 0.7322 | 271 | |
| 0.70673 | 3 | < 0.1% |
| 0.68779 | 18 | < 0.1% |
| 0.67487 | 4 | < 0.1% |
| 0.66517 | 2 | < 0.1% |
| 0.65571 | 4 | < 0.1% |
| 0.6407 | 1 | < 0.1% |
| 0.63424 | 3 | < 0.1% |
sift_converted_rankscore
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 352 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 29211 |
| Missing (%) | 75.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.24909625 |
| Minimum | 0.00964 |
|---|---|
| Maximum | 0.91255 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0.00964 |
|---|---|
| 5-th percentile | 0.00964 |
| Q1 | 0.00964 |
| median | 0.24955 |
| Q3 | 0.44694 |
| 95-th percentile | 0.72154 |
| Maximum | 0.91255 |
| Range | 0.90291 |
| Interquartile range (IQR) | 0.4373 |
Descriptive statistics
| Standard deviation | 0.24998814 |
|---|---|
| Coefficient of variation (CV) | 1.0035805 |
| Kurtosis | -0.62151011 |
| Mean | 0.24909625 |
| Median Absolute Deviation (MAD) | 0.23991 |
| Skewness | 0.62751686 |
| Sum | 2353.4613 |
| Variance | 0.062494069 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.00964 | 3943 | 10.2% |
| 0.44694 | 971 | 2.5% |
| 0.46129 | 958 | 2.5% |
| 0.25768 | 828 | 2.1% |
| 0.63226 | 283 | 0.7% |
| 0.5553 | 273 | 0.7% |
| 0.91255 | 204 | 0.5% |
| 0.72154 | 184 | 0.5% |
| 0.35349 | 119 | 0.3% |
| 0.7849 | 115 | 0.3% |
| Other values (342) | 1570 | 4.1% |
| (Missing) | 29211 |
| Value | Count | Frequency (%) |
| 0.00964 | 3943 | |
| 0.02084 | 1 | < 0.1% |
| 0.02176 | 1 | < 0.1% |
| 0.02228 | 1 | < 0.1% |
| 0.02239 | 1 | < 0.1% |
| 0.02292 | 26 | 0.1% |
| 0.02407 | 1 | < 0.1% |
| 0.02782 | 1 | < 0.1% |
| 0.02803 | 31 | 0.1% |
| 0.02832 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.91255 | 204 | |
| 0.7849 | 115 | |
| 0.72154 | 184 | |
| 0.68238 | 51 | 0.1% |
| 0.65419 | 25 | 0.1% |
| 0.63226 | 283 | |
| 0.61437 | 11 | < 0.1% |
| 0.59928 | 20 | 0.1% |
| 0.58626 | 9 | < 0.1% |
| 0.5748 | 5 | < 0.1% |
strand
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1973 |
| Missing (%) | 5.1% |
| Memory size | 302.1 KiB |
| 1.0 | |
|---|---|
| -1.0 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.4231587 |
| Min length | 3 |
Characters and Unicode
| Total characters | 125582 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 21162 | |
| -1.0 | 15524 | |
| (Missing) | 1973 | 5.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 36686 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 36686 | |
| . | 36686 | |
| 0 | 36686 | |
| - | 15524 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 73372 | |
| Other Punctuation | 36686 | |
| Dash Punctuation | 15524 | 12.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 36686 | |
| 0 | 36686 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36686 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15524 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 125582 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 36686 | |
| . | 36686 | |
| 0 | 36686 | |
| - | 15524 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125582 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 36686 | |
| . | 36686 | |
| 0 | 36686 | |
| - | 15524 |
sift_score
Text
MISSING 
| Distinct | 93 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 28997 |
| Missing (%) | 75.0% |
| Memory size | 302.1 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 4 |
| Mean length | 2.6549369 |
| Min length | 1 |
Characters and Unicode
| Total characters | 25652 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.04 |
| 3rd row | 1.0 |
| 4th row | 0.08 |
| 5th row | 1.0 |
| Value | Count | Frequency (%) |
| 1 | 4082 | |
| 0.08 | 996 | 10.3% |
| 0.09 | 987 | 10.2% |
| 0.04 | 898 | 9.3% |
| 0 | 377 | 3.9% |
| 0.01 | 377 | 3.9% |
| 0.05 | 338 | 3.5% |
| 214 | 2.2% | |
| 0.03 | 177 | 1.8% |
| 0.16 | 129 | 1.3% |
| Other values (75) | 1087 | 11.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9270 | |
| . | 5782 | |
| 1 | 5117 | |
| 4 | 1168 | 4.6% |
| 9 | 1076 | 4.2% |
| 8 | 1046 | 4.1% |
| , | 579 | 2.3% |
| 5 | 487 | 1.9% |
| 3 | 416 | 1.6% |
| 2 | 352 | 1.4% |
| Other values (2) | 359 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19291 | |
| Other Punctuation | 6361 | 24.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9270 | |
| 1 | 5117 | |
| 4 | 1168 | 6.1% |
| 9 | 1076 | 5.6% |
| 8 | 1046 | 5.4% |
| 5 | 487 | 2.5% |
| 3 | 416 | 2.2% |
| 2 | 352 | 1.8% |
| 6 | 256 | 1.3% |
| 7 | 103 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5782 | |
| , | 579 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25652 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9270 | |
| . | 5782 | |
| 1 | 5117 | |
| 4 | 1168 | 4.6% |
| 9 | 1076 | 4.2% |
| 8 | 1046 | 4.1% |
| , | 579 | 2.3% |
| 5 | 487 | 1.9% |
| 3 | 416 | 1.6% |
| 2 | 352 | 1.4% |
| Other values (2) | 359 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25652 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9270 | |
| . | 5782 | |
| 1 | 5117 | |
| 4 | 1168 | 4.6% |
| 9 | 1076 | 4.2% |
| 8 | 1046 | 4.1% |
| , | 579 | 2.3% |
| 5 | 487 | 1.9% |
| 3 | 416 | 1.6% |
| 2 | 352 | 1.4% |
| Other values (2) | 359 | 1.4% |
hgvsc
Text
MISSING 
| Distinct | 3315 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 2144 |
| Missing (%) | 5.5% |
| Memory size | 302.1 KiB |
Length
| Max length | 89 |
|---|---|
| Median length | 27 |
| Mean length | 28.903957 |
| Min length | 24 |
Characters and Unicode
| Total characters | 1055428 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1902 ? |
|---|---|
| Unique (%) | 5.2% |
Sample
| 1st row | ENST00000380152.8:c.-26G>A |
|---|---|
| 2nd row | ENST00000380152.8:c.3807T>C |
| 3rd row | ENST00000380152.8:c.4563A>G |
| 4th row | ENST00000380152.8:c.6513G>C |
| 5th row | ENST00000380152.8:c.7397T>C |
| Value | Count | Frequency (%) |
| enst00000380152.8:c.4563a>g | 1722 | 4.7% |
| enst00000380152.8:c.6513g>c | 1721 | 4.7% |
| enst00000380152.8:c.7397t>c | 1720 | 4.7% |
| enst00000380152.8:c.7806-14t>c | 1239 | 3.4% |
| enst00000357654.9:c.2612c>t | 991 | 2.7% |
| enst00000357654.9:c.4837a>g | 962 | 2.6% |
| enst00000357654.9:c.4308t>c | 958 | 2.6% |
| enst00000357654.9:c.2082c>t | 955 | 2.6% |
| enst00000357654.9:c.3548a>g | 955 | 2.6% |
| enst00000357654.9:c.2311t>c | 953 | 2.6% |
| Other values (3305) | 24339 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 215877 | |
| . | 73030 | 6.9% |
| 3 | 61357 | 5.8% |
| 5 | 54032 | 5.1% |
| 1 | 52529 | 5.0% |
| T | 52121 | 4.9% |
| 8 | 49104 | 4.7% |
| 2 | 48400 | 4.6% |
| 4 | 39892 | 3.8% |
| 6 | 38095 | 3.6% |
| Other values (23) | 370991 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 627821 | |
| Uppercase Letter | 215808 | 20.4% |
| Other Punctuation | 110167 | 10.4% |
| Lowercase Letter | 47309 | 4.5% |
| Math Symbol | 40950 | 3.9% |
| Dash Punctuation | 10635 | 1.0% |
| Connector Punctuation | 2738 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 215877 | |
| 3 | 61357 | 9.8% |
| 5 | 54032 | 8.6% |
| 1 | 52529 | 8.4% |
| 8 | 49104 | 7.8% |
| 2 | 48400 | 7.7% |
| 4 | 39892 | 6.4% |
| 6 | 38095 | 6.1% |
| 7 | 35814 | 5.7% |
| 9 | 32721 | 5.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 36515 | |
| d | 2757 | 5.8% |
| e | 2633 | 5.6% |
| l | 2633 | 5.6% |
| i | 841 | 1.8% |
| n | 841 | 1.8% |
| s | 841 | 1.8% |
| u | 124 | 0.3% |
| p | 124 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 52121 | |
| E | 36515 | |
| N | 36515 | |
| S | 36515 | |
| G | 18492 | 8.6% |
| C | 18134 | 8.4% |
| A | 17516 | 8.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 73030 | |
| : | 36515 | |
| * | 622 | 0.6% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 32917 | |
| + | 8033 | 19.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10635 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2738 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 792311 | |
| Latin | 263117 | 24.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 215877 | |
| . | 73030 | 9.2% |
| 3 | 61357 | 7.7% |
| 5 | 54032 | 6.8% |
| 1 | 52529 | 6.6% |
| 8 | 49104 | 6.2% |
| 2 | 48400 | 6.1% |
| 4 | 39892 | 5.0% |
| 6 | 38095 | 4.8% |
| : | 36515 | 4.6% |
| Other values (7) | 123480 |
Latin
| Value | Count | Frequency (%) |
| T | 52121 | |
| E | 36515 | |
| N | 36515 | |
| S | 36515 | |
| c | 36515 | |
| G | 18492 | 7.0% |
| C | 18134 | 6.9% |
| A | 17516 | 6.7% |
| d | 2757 | 1.0% |
| e | 2633 | 1.0% |
| Other values (6) | 5404 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1055428 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 215877 | |
| . | 73030 | 6.9% |
| 3 | 61357 | 5.8% |
| 5 | 54032 | 5.1% |
| 1 | 52529 | 5.0% |
| T | 52121 | 4.9% |
| 8 | 49104 | 4.7% |
| 2 | 48400 | 4.6% |
| 4 | 39892 | 3.8% |
| 6 | 38095 | 3.6% |
| Other values (23) | 370991 |
clin_sig_allele
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8565 |
| Missing (%) | 22.2% |
| Memory size | 302.1 KiB |
| NEG | |
|---|---|
| VUS | 744 |
| POS | 209 |
| A:risk_factor;A:benign | 22 |
| G:risk_factor;G:benign;G:benign/likely_benign;G:likely_benign | 3 |
Length
| Max length | 95 |
|---|---|
| Median length | 3 |
| Mean length | 3.0227288 |
| Min length | 3 |
Characters and Unicode
| Total characters | 90966 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NEG |
|---|---|
| 2nd row | NEG |
| 3rd row | NEG |
| 4th row | NEG |
| 5th row | NEG |
Common Values
| Value | Count | Frequency (%) |
| NEG | 29115 | |
| VUS | 744 | 1.9% |
| POS | 209 | 0.5% |
| A:risk_factor;A:benign | 22 | 0.1% |
| G:risk_factor;G:benign;G:benign/likely_benign;G:likely_benign | 3 | < 0.1% |
| T:uncertain_significance;G:risk_factor;G:benign/likely_benign;G:uncertain_significance;G:benign | 1 | < 0.1% |
| (Missing) | 8565 | 22.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| neg | 29115 | |
| vus | 744 | 2.5% |
| pos | 209 | 0.7% |
| a:risk_factor;a:benign | 22 | 0.1% |
| g:risk_factor;g:benign;g:benign/likely_benign;g:likely_benign | 3 | < 0.1% |
| t:uncertain_significance;g:risk_factor;g:benign/likely_benign;g:uncertain_significance;g:benign | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 29131 | |
| N | 29115 | |
| E | 29115 | |
| S | 953 | 1.0% |
| V | 744 | 0.8% |
| U | 744 | 0.8% |
| P | 209 | 0.2% |
| O | 209 | 0.2% |
| n | 82 | 0.1% |
| i | 78 | 0.1% |
| Other values (20) | 586 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 90265 | |
| Lowercase Letter | 566 | 0.6% |
| Other Punctuation | 100 | 0.1% |
| Connector Punctuation | 35 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 82 | |
| i | 78 | |
| r | 54 | |
| e | 48 | |
| g | 39 | 6.9% |
| b | 37 | 6.5% |
| k | 33 | 5.8% |
| c | 32 | 5.7% |
| a | 30 | 5.3% |
| f | 28 | 4.9% |
| Other values (6) | 105 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 29131 | |
| N | 29115 | |
| E | 29115 | |
| S | 953 | 1.1% |
| V | 744 | 0.8% |
| U | 744 | 0.8% |
| P | 209 | 0.2% |
| O | 209 | 0.2% |
| A | 44 | < 0.1% |
| T | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 61 | |
| ; | 35 | |
| / | 4 | 4.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 35 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90831 | |
| Common | 135 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 29131 | |
| N | 29115 | |
| E | 29115 | |
| S | 953 | 1.0% |
| V | 744 | 0.8% |
| U | 744 | 0.8% |
| P | 209 | 0.2% |
| O | 209 | 0.2% |
| n | 82 | 0.1% |
| i | 78 | 0.1% |
| Other values (16) | 451 | 0.5% |
Common
| Value | Count | Frequency (%) |
| : | 61 | |
| _ | 35 | |
| ; | 35 | |
| / | 4 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 90966 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 29131 | |
| N | 29115 | |
| E | 29115 | |
| S | 953 | 1.0% |
| V | 744 | 0.8% |
| U | 744 | 0.8% |
| P | 209 | 0.2% |
| O | 209 | 0.2% |
| n | 82 | 0.1% |
| i | 78 | 0.1% |
| Other values (20) | 586 | 0.6% |
pubmed_count
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 28072 |
| Missing (%) | 72.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.084727 |
| Minimum | 1 |
|---|---|
| Maximum | 114 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 28 |
| median | 34 |
| Q3 | 88 |
| 95-th percentile | 114 |
| Maximum | 114 |
| Range | 113 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 39.864674 |
|---|---|
| Coefficient of variation (CV) | 0.76538127 |
| Kurtosis | -1.3048763 |
| Mean | 52.084727 |
| Median Absolute Deviation (MAD) | 32 |
| Skewness | 0.35859194 |
| Sum | 551421 |
| Variance | 1589.1922 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 114 | 2007 | 5.2% |
| 76 | 1002 | 2.6% |
| 34 | 1000 | 2.6% |
| 28 | 960 | 2.5% |
| 88 | 953 | 2.5% |
| 33 | 953 | 2.5% |
| 1 | 720 | 1.9% |
| 2 | 372 | 1.0% |
| 3 | 360 | 0.9% |
| 41 | 274 | 0.7% |
| Other values (48) | 1986 | 5.1% |
| (Missing) | 28072 |
| Value | Count | Frequency (%) |
| 1 | 720 | |
| 2 | 372 | |
| 3 | 360 | |
| 4 | 99 | 0.3% |
| 5 | 152 | 0.4% |
| 6 | 143 | 0.4% |
| 7 | 98 | 0.3% |
| 8 | 47 | 0.1% |
| 9 | 43 | 0.1% |
| 10 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 114 | 2007 | |
| 98 | 4 | < 0.1% |
| 88 | 953 | |
| 87 | 2 | < 0.1% |
| 85 | 22 | 0.1% |
| 82 | 1 | < 0.1% |
| 80 | 257 | 0.7% |
| 76 | 1002 | |
| 60 | 3 | < 0.1% |
| 59 | 8 | < 0.1% |
frequencies
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 38659 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 302.1 KiB |
frequencies_af
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 597 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 9481 |
| Missing (%) | 24.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.47556009 |
| Minimum | 0.0002 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0.0002 |
|---|---|
| 5-th percentile | 0.0218 |
| Q1 | 0.2494 |
| median | 0.3526 |
| Q3 | 0.7188 |
| 95-th percentile | 0.9758 |
| Maximum | 1 |
| Range | 0.9998 |
| Interquartile range (IQR) | 0.4694 |
Descriptive statistics
| Standard deviation | 0.31156372 |
|---|---|
| Coefficient of variation (CV) | 0.65515111 |
| Kurtosis | -1.0056511 |
| Mean | 0.47556009 |
| Median Absolute Deviation (MAD) | 0.1823 |
| Skewness | 0.46408428 |
| Sum | 13875.892 |
| Variance | 0.097071953 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.974 | 1805 | 4.7% |
| 0.9736 | 1721 | 4.5% |
| 0.9758 | 1720 | 4.4% |
| 0.5315 | 1239 | 3.2% |
| 0.3526 | 1013 | 2.6% |
| 0.5439 | 991 | 2.6% |
| 0.3365 | 984 | 2.5% |
| 0.3558 | 962 | 2.5% |
| 0.3363 | 958 | 2.5% |
| 0.3353 | 953 | 2.5% |
| Other values (587) | 16832 | |
| (Missing) | 9481 |
| Value | Count | Frequency (%) |
| 0.0002 | 137 | |
| 0.0004 | 73 | |
| 0.0006 | 64 | |
| 0.0008 | 30 | 0.1% |
| 0.001 | 24 | 0.1% |
| 0.0012 | 37 | 0.1% |
| 0.0014 | 14 | < 0.1% |
| 0.0016 | 50 | 0.1% |
| 0.0018 | 4 | < 0.1% |
| 0.002 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 105 | |
| 0.9996 | 1 | < 0.1% |
| 0.9992 | 94 | |
| 0.999 | 34 | 0.1% |
| 0.9986 | 87 | |
| 0.997 | 47 | |
| 0.9944 | 47 | |
| 0.9844 | 33 | 0.1% |
| 0.9824 | 43 | |
| 0.9768 | 33 | 0.1% |
frequencies_gnomadg_nfe
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 892 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 9481 |
| Missing (%) | 24.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.46965932 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 46 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.0343695 |
| Q1 | 0.2764 |
| median | 0.3321 |
| Q3 | 0.663 |
| 95-th percentile | 0.9997 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.3866 |
Descriptive statistics
| Standard deviation | 0.31598939 |
|---|---|
| Coefficient of variation (CV) | 0.67280553 |
| Kurtosis | -0.85395747 |
| Mean | 0.46965932 |
| Median Absolute Deviation (MAD) | 0.1461 |
| Skewness | 0.62176552 |
| Sum | 13703.72 |
| Variance | 0.099849296 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.9997 | 5417 | 14.0% |
| 0.521 | 1239 | 3.2% |
| 0.3301 | 1013 | 2.6% |
| 0.3317 | 1006 | 2.6% |
| 0.3407 | 991 | 2.6% |
| 0.3313 | 982 | 2.5% |
| 0.3322 | 962 | 2.5% |
| 0.3302 | 955 | 2.5% |
| 0.3295 | 953 | 2.5% |
| 0.3165 | 880 | 2.3% |
| Other values (882) | 14780 | |
| (Missing) | 9481 |
| Value | Count | Frequency (%) |
| 0 | 46 | |
| 1.47 × 10-5 | 13 | < 0.1% |
| 1.471 × 10-5 | 3 | < 0.1% |
| 1.473 × 10-5 | 1 | < 0.1% |
| 1.475 × 10-5 | 1 | < 0.1% |
| 2.94 × 10-5 | 10 | < 0.1% |
| 2.941 × 10-5 | 1 | < 0.1% |
| 2.942 × 10-5 | 1 | < 0.1% |
| 2.964 × 10-5 | 2 | < 0.1% |
| 2.975 × 10-5 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 129 | 0.3% |
| 0.9999 | 58 | 0.2% |
| 0.9998 | 33 | 0.1% |
| 0.9997 | 5417 | |
| 0.9995 | 47 | 0.1% |
| 0.9946 | 47 | 0.1% |
| 0.993 | 47 | 0.1% |
| 0.989 | 40 | 0.1% |
| 0.9863 | 47 | 0.1% |
| 0.9852 | 47 | 0.1% |
variant_class
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1952 |
| Missing (%) | 5.0% |
| Memory size | 302.1 KiB |
| SNV | |
|---|---|
| deletion | 2637 |
| insertion | 965 |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 3.5169314 |
| Min length | 3 |
Characters and Unicode
| Total characters | 129096 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SNV |
|---|---|
| 2nd row | SNV |
| 3rd row | SNV |
| 4th row | SNV |
| 5th row | SNV |
Common Values
| Value | Count | Frequency (%) |
| SNV | 33105 | |
| deletion | 2637 | 6.8% |
| insertion | 965 | 2.5% |
| (Missing) | 1952 | 5.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| snv | 33105 | |
| deletion | 2637 | 7.2% |
| insertion | 965 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 33105 | |
| N | 33105 | |
| V | 33105 | |
| e | 6239 | 4.8% |
| i | 4567 | 3.5% |
| n | 4567 | 3.5% |
| t | 3602 | 2.8% |
| o | 3602 | 2.8% |
| d | 2637 | 2.0% |
| l | 2637 | 2.0% |
| Other values (2) | 1930 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 99315 | |
| Lowercase Letter | 29781 | 23.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6239 | |
| i | 4567 | |
| n | 4567 | |
| t | 3602 | |
| o | 3602 | |
| d | 2637 | |
| l | 2637 | |
| s | 965 | 3.2% |
| r | 965 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 33105 | |
| N | 33105 | |
| V | 33105 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 129096 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 33105 | |
| N | 33105 | |
| V | 33105 | |
| e | 6239 | 4.8% |
| i | 4567 | 3.5% |
| n | 4567 | 3.5% |
| t | 3602 | 2.8% |
| o | 3602 | 2.8% |
| d | 2637 | 2.0% |
| l | 2637 | 2.0% |
| Other values (2) | 1930 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129096 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 33105 | |
| N | 33105 | |
| V | 33105 | |
| e | 6239 | 4.8% |
| i | 4567 | 3.5% |
| n | 4567 | 3.5% |
| t | 3602 | 2.8% |
| o | 3602 | 2.8% |
| d | 2637 | 2.0% |
| l | 2637 | 2.0% |
| Other values (2) | 1930 | 1.5% |
seq_region_name
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1952 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.282317 |
| Minimum | 1 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 302.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 11 |
| median | 13 |
| Q3 | 17 |
| 95-th percentile | 17 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.069982 |
|---|---|
| Coefficient of variation (CV) | 0.41278711 |
| Kurtosis | -0.27768666 |
| Mean | 12.282317 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.81667676 |
| Sum | 450847 |
| Variance | 25.704717 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 13595 | |
| 17 | 10865 | |
| 2 | 2930 | 7.6% |
| 11 | 2329 | 6.0% |
| 3 | 1652 | 4.3% |
| 5 | 1311 | 3.4% |
| 7 | 1010 | 2.6% |
| 16 | 717 | 1.9% |
| 22 | 566 | 1.5% |
| 8 | 473 | 1.2% |
| Other values (4) | 1259 | 3.3% |
| (Missing) | 1952 | 5.0% |
| Value | Count | Frequency (%) |
| 1 | 206 | 0.5% |
| 2 | 2930 | 7.6% |
| 3 | 1652 | 4.3% |
| 4 | 370 | 1.0% |
| 5 | 1311 | 3.4% |
| 7 | 1010 | 2.6% |
| 8 | 473 | 1.2% |
| 10 | 336 | 0.9% |
| 11 | 2329 | 6.0% |
| 13 | 13595 |
| Value | Count | Frequency (%) |
| 22 | 566 | 1.5% |
| 19 | 347 | 0.9% |
| 17 | 10865 | |
| 16 | 717 | 1.9% |
| 13 | 13595 | |
| 11 | 2329 | 6.0% |
| 10 | 336 | 0.9% |
| 8 | 473 | 1.2% |
| 7 | 1010 | 2.6% |
| 5 | 1311 | 3.4% |
MSP
Text
| Distinct | 1671 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 270613 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 828944H |
|---|---|
| 2nd row | 828944H |
| 3rd row | 828944H |
| 4th row | 828944H |
| 5th row | 828944H |
| Value | Count | Frequency (%) |
| e656391 | 523 | 1.4% |
| 803127n | 500 | 1.3% |
| 266865y | 486 | 1.3% |
| a435161 | 470 | 1.2% |
| 246950c | 464 | 1.2% |
| 810713f | 449 | 1.2% |
| 906063n | 443 | 1.1% |
| 892831l | 441 | 1.1% |
| 327452s | 441 | 1.1% |
| 195336c | 438 | 1.1% |
| Other values (1661) | 34004 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 28183 | |
| 9 | 28074 | |
| 2 | 27084 | |
| 1 | 24279 | |
| 3 | 23197 | |
| 7 | 22564 | |
| 6 | 21216 | |
| 5 | 20453 | |
| 0 | 19985 | |
| 4 | 18734 | |
| Other values (24) | 36844 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 233769 | |
| Uppercase Letter | 36838 | 13.6% |
| Lowercase Letter | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 3182 | 8.6% |
| T | 2727 | 7.4% |
| C | 2247 | 6.1% |
| N | 2101 | 5.7% |
| P | 2030 | 5.5% |
| H | 1973 | 5.4% |
| U | 1965 | 5.3% |
| S | 1778 | 4.8% |
| K | 1708 | 4.6% |
| Y | 1644 | 4.5% |
| Other values (13) | 15483 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 28183 | |
| 9 | 28074 | |
| 2 | 27084 | |
| 1 | 24279 | |
| 3 | 23197 | |
| 7 | 22564 | |
| 6 | 21216 | |
| 5 | 20453 | |
| 0 | 19985 | |
| 4 | 18734 |
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 233769 | |
| Latin | 36844 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 3182 | 8.6% |
| T | 2727 | 7.4% |
| C | 2247 | 6.1% |
| N | 2101 | 5.7% |
| P | 2030 | 5.5% |
| H | 1973 | 5.4% |
| U | 1965 | 5.3% |
| S | 1778 | 4.8% |
| K | 1708 | 4.6% |
| Y | 1644 | 4.5% |
| Other values (14) | 15489 |
Common
| Value | Count | Frequency (%) |
| 8 | 28183 | |
| 9 | 28074 | |
| 2 | 27084 | |
| 1 | 24279 | |
| 3 | 23197 | |
| 7 | 22564 | |
| 6 | 21216 | |
| 5 | 20453 | |
| 0 | 19985 | |
| 4 | 18734 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 270613 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 28183 | |
| 9 | 28074 | |
| 2 | 27084 | |
| 1 | 24279 | |
| 3 | 23197 | |
| 7 | 22564 | |
| 6 | 21216 | |
| 5 | 20453 | |
| 0 | 19985 | |
| 4 | 18734 | |
| Other values (24) | 36844 |
RIS.
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 302.1 KiB |
| NEG | |
|---|---|
| VUS | 126 |
| POS | 86 |
| POS VUS | 11 |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.0014227 |
| Min length | 3 |
Characters and Unicode
| Total characters | 116032 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NEG |
|---|---|
| 2nd row | NEG |
| 3rd row | NEG |
| 4th row | NEG |
| 5th row | NEG |
Common Values
| Value | Count | Frequency (%) |
| NEG | 38436 | |
| VUS | 126 | 0.3% |
| POS | 86 | 0.2% |
| POS VUS | 11 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| neg | 38436 | |
| vus | 137 | 0.4% |
| pos | 97 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 38436 | |
| E | 38436 | |
| G | 38436 | |
| S | 234 | 0.2% |
| V | 137 | 0.1% |
| U | 137 | 0.1% |
| P | 97 | 0.1% |
| O | 97 | 0.1% |
| 22 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 116010 | |
| Control | 22 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 38436 | |
| E | 38436 | |
| G | 38436 | |
| S | 234 | 0.2% |
| V | 137 | 0.1% |
| U | 137 | 0.1% |
| P | 97 | 0.1% |
| O | 97 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 22 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 116010 | |
| Common | 22 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 38436 | |
| E | 38436 | |
| G | 38436 | |
| S | 234 | 0.2% |
| V | 137 | 0.1% |
| U | 137 | 0.1% |
| P | 97 | 0.1% |
| O | 97 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 22 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 116032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 38436 | |
| E | 38436 | |
| G | 38436 | |
| S | 234 | 0.2% |
| V | 137 | 0.1% |
| U | 137 | 0.1% |
| P | 97 | 0.1% |
| O | 97 | 0.1% |
| 22 | < 0.1% |
| CHROM | POS | 1000gp3_eur_af | clinpred_rankscore | clinvar_id | gnomad_exomes_non_cancer_nfe_af | mutationassessor_rankscore | mutationtaster_converted_rankscore | polyphen2_hdiv_rankscore | sift_converted_rankscore | pubmed_count | frequencies_af | frequencies_gnomadg_nfe | seq_region_name | AF | GENEINFO | TISSUE | CTYPE | GT | clinpred_pred | strand | clin_sig_allele | variant_class | RIS. | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CHROM | 1.000 | -0.288 | -0.185 | -0.119 | -0.166 | -0.147 | 0.050 | 0.090 | 0.060 | 0.201 | 0.282 | -0.097 | -0.117 | 1.000 | 0.029 | 1.000 | 0.223 | 0.597 | 0.244 | 0.099 | 0.860 | 0.046 | 0.164 | 0.029 |
| POS | -0.288 | 1.000 | -0.070 | 0.114 | -0.057 | -0.028 | -0.049 | 0.403 | 0.006 | 0.148 | -0.289 | 0.028 | -0.011 | -0.281 | 1.000 | 1.000 | 0.216 | 0.583 | 0.161 | 0.073 | 0.349 | 0.109 | 0.172 | 0.024 |
| 1000gp3_eur_af | -0.185 | -0.070 | 1.000 | 0.186 | 0.677 | 1.000 | -0.430 | -0.050 | -0.299 | -0.541 | 0.884 | 0.970 | 1.000 | -0.185 | 0.107 | 0.526 | 0.053 | 0.542 | 0.506 | 0.044 | 0.883 | 0.295 | 1.000 | 0.031 |
| clinpred_rankscore | -0.119 | 0.114 | 0.186 | 1.000 | 0.290 | 0.056 | 0.590 | 0.213 | 0.702 | 0.465 | -0.124 | 0.023 | 0.185 | -0.119 | 0.000 | 0.166 | 0.558 | 0.063 | 0.473 | 0.977 | 0.116 | 0.395 | 1.000 | 0.199 |
| clinvar_id | -0.166 | -0.057 | 0.677 | 0.290 | 1.000 | 0.582 | 0.087 | -0.148 | -0.001 | -0.235 | 0.006 | 0.685 | 0.676 | -0.166 | 0.000 | 0.165 | 0.507 | 0.326 | 0.456 | 0.430 | 0.070 | 0.333 | 1.000 | 0.090 |
| gnomad_exomes_non_cancer_nfe_af | -0.147 | -0.028 | 1.000 | 0.056 | 0.582 | 1.000 | -0.431 | -0.083 | -0.315 | -0.549 | 0.888 | 0.970 | 1.000 | -0.147 | 0.108 | 0.517 | 0.174 | 0.531 | 0.536 | 0.195 | 0.857 | 0.209 | 1.000 | 0.105 |
| mutationassessor_rankscore | 0.050 | -0.049 | -0.430 | 0.590 | 0.087 | -0.431 | 1.000 | -0.329 | 0.944 | 0.837 | -0.212 | -0.693 | -0.430 | 0.050 | 0.000 | 0.233 | 0.290 | 0.363 | 0.303 | 0.515 | 0.216 | 0.250 | 1.000 | 0.126 |
| mutationtaster_converted_rankscore | 0.090 | 0.403 | -0.050 | 0.213 | -0.148 | -0.083 | -0.329 | 1.000 | -0.157 | 0.032 | 0.212 | -0.048 | -0.051 | 0.090 | 0.009 | 0.218 | 0.276 | 0.200 | 0.275 | 0.598 | 0.313 | 0.330 | 1.000 | 0.212 |
| polyphen2_hdiv_rankscore | 0.060 | 0.006 | -0.299 | 0.702 | -0.001 | -0.315 | 0.944 | -0.157 | 1.000 | 0.848 | -0.100 | -0.606 | -0.298 | 0.060 | 0.000 | 0.159 | 0.243 | 0.410 | 0.249 | 0.547 | 0.252 | 0.278 | 1.000 | 0.170 |
| sift_converted_rankscore | 0.201 | 0.148 | -0.541 | 0.465 | -0.235 | -0.549 | 0.837 | 0.032 | 0.848 | 1.000 | -0.207 | -0.680 | -0.542 | 0.201 | 0.046 | 0.344 | 0.297 | 0.189 | 0.383 | 0.554 | 0.519 | 0.241 | 1.000 | 0.157 |
| pubmed_count | 0.282 | -0.289 | 0.884 | -0.124 | 0.006 | 0.888 | -0.212 | 0.212 | -0.100 | -0.207 | 1.000 | 0.164 | 0.084 | 0.282 | 0.000 | 0.144 | 0.047 | 0.622 | 0.128 | 0.554 | 0.526 | 0.181 | 0.000 | 0.208 |
| frequencies_af | -0.097 | 0.028 | 0.970 | 0.023 | 0.685 | 0.970 | -0.693 | -0.048 | -0.606 | -0.680 | 0.164 | 1.000 | 0.956 | -0.097 | 0.063 | 0.510 | 0.070 | 0.377 | 0.486 | 0.045 | 0.704 | 0.056 | 0.052 | 0.045 |
| frequencies_gnomadg_nfe | -0.117 | -0.011 | 1.000 | 0.185 | 0.676 | 1.000 | -0.430 | -0.051 | -0.298 | -0.542 | 0.084 | 0.956 | 1.000 | -0.117 | 0.063 | 0.493 | 0.075 | 0.410 | 0.489 | 0.044 | 0.701 | 0.080 | 0.053 | 0.045 |
| seq_region_name | 1.000 | -0.281 | -0.185 | -0.119 | -0.166 | -0.147 | 0.050 | 0.090 | 0.060 | 0.201 | 0.282 | -0.097 | -0.117 | 1.000 | 0.029 | 1.000 | 0.215 | 0.598 | 0.243 | 0.099 | 0.860 | 0.046 | 0.164 | 0.028 |
| AF | 0.029 | 1.000 | 0.107 | 0.000 | 0.000 | 0.108 | 0.000 | 0.009 | 0.000 | 0.046 | 0.000 | 0.063 | 0.063 | 0.029 | 1.000 | 0.029 | 0.013 | 1.000 | 0.057 | 0.000 | 0.029 | 0.000 | 0.000 | 0.000 |
| GENEINFO | 1.000 | 1.000 | 0.526 | 0.166 | 0.165 | 0.517 | 0.233 | 0.218 | 0.159 | 0.344 | 0.144 | 0.510 | 0.493 | 1.000 | 0.029 | 1.000 | 0.309 | 1.000 | 0.317 | 0.210 | 1.000 | 0.154 | 0.059 | 0.012 |
| TISSUE | 0.223 | 0.216 | 0.053 | 0.558 | 0.507 | 0.174 | 0.290 | 0.276 | 0.243 | 0.297 | 0.047 | 0.070 | 0.075 | 0.215 | 0.013 | 0.309 | 1.000 | 0.205 | 0.810 | 0.407 | 0.030 | 0.434 | 0.094 | 0.041 |
| CTYPE | 0.597 | 0.583 | 0.542 | 0.063 | 0.326 | 0.531 | 0.363 | 0.200 | 0.410 | 0.189 | 0.622 | 0.377 | 0.410 | 0.598 | 1.000 | 1.000 | 0.205 | 1.000 | 0.208 | 0.007 | 0.056 | 0.050 | 0.171 | 0.043 |
| GT | 0.244 | 0.161 | 0.506 | 0.473 | 0.456 | 0.536 | 0.303 | 0.275 | 0.249 | 0.383 | 0.128 | 0.486 | 0.489 | 0.243 | 0.057 | 0.317 | 0.810 | 0.208 | 1.000 | 0.483 | 0.172 | 0.429 | 0.158 | 0.044 |
| clinpred_pred | 0.099 | 0.073 | 0.044 | 0.977 | 0.430 | 0.195 | 0.515 | 0.598 | 0.547 | 0.554 | 0.554 | 0.045 | 0.044 | 0.099 | 0.000 | 0.210 | 0.407 | 0.007 | 0.483 | 1.000 | 0.094 | 0.623 | 1.000 | 0.276 |
| strand | 0.860 | 0.349 | 0.883 | 0.116 | 0.070 | 0.857 | 0.216 | 0.313 | 0.252 | 0.519 | 0.526 | 0.704 | 0.701 | 0.860 | 0.029 | 1.000 | 0.030 | 0.056 | 0.172 | 0.094 | 1.000 | 0.028 | 0.098 | 0.020 |
| clin_sig_allele | 0.046 | 0.109 | 0.295 | 0.395 | 0.333 | 0.209 | 0.250 | 0.330 | 0.278 | 0.241 | 0.181 | 0.056 | 0.080 | 0.046 | 0.000 | 0.154 | 0.434 | 0.050 | 0.429 | 0.623 | 0.028 | 1.000 | 0.068 | 0.375 |
| variant_class | 0.164 | 0.172 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.052 | 0.053 | 0.164 | 0.000 | 0.059 | 0.094 | 0.171 | 0.158 | 1.000 | 0.098 | 0.068 | 1.000 | 0.016 |
| RIS. | 0.029 | 0.024 | 0.031 | 0.199 | 0.090 | 0.105 | 0.126 | 0.212 | 0.170 | 0.157 | 0.208 | 0.045 | 0.045 | 0.028 | 0.000 | 0.012 | 0.041 | 0.043 | 0.044 | 0.276 | 0.020 | 0.375 | 0.016 | 1.000 |
| CHROM | POS | REF | ALT | AF | GENEINFO | NAME | TISSUE | CTYPE | GT | 1000gp3_eur_af | clinpred_pred | clinpred_rankscore | clinvar_id | domains_count | gnomad_exomes_non_cancer_nfe_af | mutationassessor_rankscore | mutationtaster_converted_rankscore | polyphen2_hdiv_rankscore | sift_converted_rankscore | strand | sift_score | hgvsc | clin_sig_allele | pubmed_count | frequencies | frequencies_af | frequencies_gnomadg_nfe | variant_class | seq_region_name | MSP | RIS. | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 13 | 32890572 | G | A | 0.0 | BRCA2 | BRCA24/19 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.-26G>A | NEG | NaN | NaN | 0.2093 | 0.2663 | SNV | 13.0 | 828944H | NEG |
| 1 | 13 | 32912299 | T | C | 0.0 | BRCA2 | BRCA24/19 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.3807T>C | NEG | NaN | NaN | 0.1681 | 0.1860 | SNV | 13.0 | 828944H | NEG |
| 2 | 13 | 32913055 | A | G | 0.0 | BRCA2 | BRCA24/19 | GERMLINE | BRCA | 1/1 | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.4563A>G | NEG | NaN | NaN | 0.9740 | 0.9997 | SNV | 13.0 | 828944H | NEG |
| 3 | 13 | 32915005 | G | C | 0.0 | BRCA2 | BRCA24/19 | GERMLINE | BRCA | 1/1 | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.6513G>C | NEG | NaN | NaN | 0.9736 | 0.9997 | SNV | 13.0 | 828944H | NEG |
| 4 | 13 | 32929387 | T | C | 0.0 | BRCA2 | BRCA24/19 | GERMLINE | BRCA | 1/1 | 0.999006 | T | 0.00085 | 133738.0 | 2.0 | 0.999707 | NaN | 0.08975 | NaN | 0.00964 | 1.0 | 1.0 | ENST00000380152.8:c.7397T>C | NEG | NaN | NaN | 0.9758 | 0.9997 | SNV | 13.0 | 828944H | NEG |
| 5 | 13 | 32936646 | T | C | 0.0 | BRCA2 | BRCA24/19 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.7806-14T>C | NEG | NaN | NaN | 0.5315 | 0.5210 | SNV | 13.0 | 828944H | NEG |
| 6 | 13 | 32906729 | A | C | 0.0 | BRCA2 | BRCA25/19 | GERMLINE | BRCA | 1/1 | 0.295229 | T | 0.00012 | 9329.0 | 2.0 | 0.278914 | NaN | 0.08975 | NaN | 0.25768 | 1.0 | 0.04 | ENST00000380152.8:c.1114A>C | NEG | NaN | NaN | 0.2494 | 0.2764 | SNV | 13.0 | 831734X | NEG |
| 7 | 13 | 32913055 | A | G | 0.0 | BRCA2 | BRCA25/19 | GERMLINE | BRCA | 1/1 | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.4563A>G | NEG | NaN | NaN | 0.9740 | 0.9997 | SNV | 13.0 | 831734X | NEG |
| 8 | 13 | 32915005 | G | C | 0.0 | BRCA2 | BRCA25/19 | GERMLINE | BRCA | 1/1 | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.6513G>C | NEG | NaN | NaN | 0.9736 | 0.9997 | SNV | 13.0 | 831734X | NEG |
| 9 | 13 | 32929387 | T | C | 0.0 | BRCA2 | BRCA25/19 | GERMLINE | BRCA | 1/1 | 0.999006 | T | 0.00085 | 133738.0 | 2.0 | 0.999707 | NaN | 0.08975 | NaN | 0.00964 | 1.0 | 1.0 | ENST00000380152.8:c.7397T>C | NEG | NaN | NaN | 0.9758 | 0.9997 | SNV | 13.0 | 831734X | NEG |
| CHROM | POS | REF | ALT | AF | GENEINFO | NAME | TISSUE | CTYPE | GT | 1000gp3_eur_af | clinpred_pred | clinpred_rankscore | clinvar_id | domains_count | gnomad_exomes_non_cancer_nfe_af | mutationassessor_rankscore | mutationtaster_converted_rankscore | polyphen2_hdiv_rankscore | sift_converted_rankscore | strand | sift_score | hgvsc | clin_sig_allele | pubmed_count | frequencies | frequencies_af | frequencies_gnomadg_nfe | variant_class | seq_region_name | MSP | RIS. | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 38649 | 19 | 1222268 | A | G | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000326873.12:c.920+263A>G | NEG | 1.0 | NaN | 0.7075 | 0.4664 | SNV | 19.0 | 961115F | NEG |
| 38650 | 19 | 1226772 | C | T | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000326873.12:c.*16+110C>T | NEG | 1.0 | NaN | 0.2714 | 0.2150 | SNV | 19.0 | 961115F | NEG |
| 38651 | 19 | 1226901 | G | T | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000326873.12:c.*16+239G>T | NEG | NaN | NaN | 0.2482 | 0.2143 | SNV | 19.0 | 961115F | NEG |
| 38652 | 22 | 29085060 | C | T | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | -1.0 | NaN | ENST00000404276.6:c.1542+63G>A | NaN | NaN | NaN | NaN | NaN | SNV | 22.0 | 961115F | NEG |
| 38653 | 22 | 29085138 | GGG | AGA | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 961115F | NEG |
| 38654 | 22 | 29085168 | C | G | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | -1.0 | NaN | ENST00000404276.6:c.1497G>C | NEG | NaN | NaN | NaN | NaN | SNV | 22.0 | 961115F | NEG |
| 38655 | 22 | 29085257 | A | G | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | -1.0 | NaN | ENST00000404276.6:c.1462-54T>C | NaN | NaN | NaN | NaN | NaN | SNV | 22.0 | 961115F | NEG |
| 38656 | 22 | 29091300 | T | C | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | -1.0 | NaN | ENST00000404276.6:c.1260-70A>G | NaN | NaN | NaN | NaN | NaN | SNV | 22.0 | 961115F | NEG |
| 38657 | 22 | 29130300 | C | T | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 1/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | -1.0 | NaN | ENST00000404276.6:c.319+91G>A | NEG | NaN | NaN | 0.2734 | 0.2807 | SNV | 22.0 | 961115F | NEG |
| 38658 | 22 | 29130813 | GAAAAAAAAAAAAA | GAAAAAAAAAAAAAA | NaN | NaN | BRCA84/20 | GERMLINE | BRCA | 0/1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 961115F | NEG |
Most frequently occurring
| CHROM | POS | REF | ALT | AF | GENEINFO | NAME | TISSUE | CTYPE | GT | 1000gp3_eur_af | clinpred_pred | clinpred_rankscore | clinvar_id | domains_count | gnomad_exomes_non_cancer_nfe_af | mutationassessor_rankscore | mutationtaster_converted_rankscore | polyphen2_hdiv_rankscore | sift_converted_rankscore | strand | sift_score | hgvsc | clin_sig_allele | pubmed_count | frequencies_af | frequencies_gnomadg_nfe | variant_class | seq_region_name | MSP | RIS. | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 13 | 32893206 | TT | T | 0.0 | BRCA2 | BRCA160/21 | SOMATIC | BRCA | 0/0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | ENST00000380152.8:c.68-4del | NaN | NaN | NaN | NaN | deletion | 13.0 | A435161 | NEG | 2 |